This file is part of the replication packet for "A Low-Cost Information Nudge Increases Citizenship Application Rates Among Low-Income Immigrants"

The replication packet for the paper includes files and datasets that allow a user to recreate the estimates and analyses found in the paper. In order to protect the privacy of the study's participants, the data in the replication data packet have been coarsened. Specifically, the variables for age, income, years on their green card, and household size have been recalculated to be quartiles. Because of this, some of the exact estimates from the paper will not be replicated. This file includes a PDF of expected tables based on the coarsened data. Many of the analyses in the analysis.do file output the results into tex tables. The file replicatedTables.tex will display these tables if the correct directory is set in the tex file. 

Some of the data files (such as the SIPP and ACS data) are publicly available and must be downloaded in order to reproduce estimates generated in the paper. 


The following table displays the table, figure, or calculation from the paper and which data file and code files are needed to reproduce it. 


Calculation                                                | Data Needed                               | Script
---------------------------------------------------------------------------------------------------------------------------------------------
Figure 1 - Map                                             | exampleGeoData.csv, NYC_shape_file        | map.R
Figure 2 - Effects of Fee Waiver, Bar Chart, Effects Chart | NNYFeeWaiverReplicationData.dta           | analysis.do, figures.R
Figure S5: Power Curve                                     | <NONE>                                    | power_analysis.R
Table S1: Descriptive                                      | NNYFeeWaiverReplicationData.dta           | analysis.do
Table S2: Balance Checks                                   | NNYFeeWaiverReplicationData.dta           | analysis.do
Table S3: Survey Response Checks                           | NNYFeeWaiverReplicationData.dta           | analysis.do
Table S4: Effect Estimates                                 | NNYFeeWaiverReplicationData.dta           | analysis.do
Table S5: Effect Estimates (Controlling for Time)          | NNYFeeWaiverReplicationData.dta           | analysis.do
Table S6: Subgroups     1                                  | NNYFeeWaiverReplicationData.dta           | analysis.do
Table S7: Subgroups     2                                  | NNYFeeWaiverReplicationData.dta           | analysis.do
Table S9: Effect Estimates (Multiple Imputation)           | NNYFeeWaiverReplicationData.dta           | analysis.do
Table S9: Comparison of the Sample of Registrants          | NNYFeeWaiverReplicationData.dta, ACS data | acs_prep.R, acs_sample_comparison.R
Table S10: Effect of Fee Waiver Notice on Fee Waiver Usage | NNYFeeWaiverReplicationData.dta           | analysis.do
Estimate of fee waiver population among LPRs               | sippp08putm2.dta, sippl08puw2.dta         | sipp_estimate.do



The following code files are used to produce the analysis, figures, and tables found in the paper. 

	analysis.do
	figures.R
	map.R
	sipp_estimate.do
	acs_prep.R
	acs_sample_comparison.R
	power_analysis.R

The following data files are included:

	nycPUMAList.csv - a list of PUMAs in NYC
	NNYFeeWaiverReplicationData.dta - DTA file containing data from the experiment
	exampleGeoData.csv - CSV file containing data similar to the geo-data used to create the map in the paper

The following data files will need to be downloaded before the ACS and SIPP analyses can be run:

	ACS
	psam_husa.csv (ACS)
	psam_husb.csv (ACS)
	psam_pusa.csv (ACS)
	psam_pusb.csv (ACS)

	SIPP
	sippp08putm2.dta (SIPP)
	sippl08puw2.dta  (SIPP)

	NYC Shape File
	NYC_shape_file (https://www1.nyc.gov/site/planning/data-maps/open-data/districts-download-metadata.page)


The following additional files are included in the replication packet:

	README_feewaiver.txt - a general explanation of the files included in the replication

	replicatedTables.tex - a tex file that will open all of the tex tables created by the replication code

	expectedTables.pdf - a PDF that contains the tables expected using the coarsened data. Because the replication files use a coarsened dataset, the tables generated for some of the analyses will not match the tables in the paper exactly. This PDF contains the tables that are calculated using the coarsened data. The estimates for the main effect reamin stable. When you use the replication code, the tables created should match the tables in this PDF. 









